29 research outputs found

    Visual Analysis Algorithms for Embedded Systems

    Get PDF
    The main contribution of this thesis is the design and development of an optimized framework to realize the deep neural classifiers on the embedded platforms. Deep convolutional networks exhibit unmatched performance in image classification. However, these deep classifiers demand huge computational power and memory storage. That is an issue on embedded devices due to limited onboard resources. The computational demand of neural networks mainly stems from the convolutional layers. A significant improvement in performance can be obtained by reducing the computational complexity of these convolutional layers, making them realizable on embedded platforms. In this thesis, we proposed a CUDA (Compute Unified Device Architecture)-based accelerated scheme to realize the deep architectures on the embedded platforms by exploiting the already trained networks. All required functions and layers to replicate the trained neural networks were implemented and accelerated using concurrent resources of embedded GPU. Performance of our CUDA-based proposed scheme was significantly improved by performing convolutions in the transform domain. This matrix multiplication based convolution was also compared with the traditional approach to analyze the improvement in inference performance. The second part of this thesis focused on the optimization of the proposed framework. The flow of our CUDA-based framework was optimized using unified memory scheme and hardware-dependent utilization of computational resources. The proposed flow was evaluated over three different image classification networks on Jetson TX1 embedded board and Nvidia Shield K1 tablet. The performance of proposed GPU-only flow was compared with its sequential and heterogeneous versions. The results showed that the proposed scheme brought the higher performance and enabled the real-time image classification on the embedded platforms with lesser storage requirements. These results motivated us towards the realization of useful real-time classification and recognition problems on the embedded platforms. Finally, we utilized the proposed framework to realize the neural network-based automatic license plate recognition (ALPR) system on a mobile platform. This highly-precise and computationally demanding system was deployed by simplifying the flow of trained deep architecture developed for powerful desktop and server environments. A comparative analysis of computational complexity, recognition accuracy and inference performance was performed

    Transforming spatio-temporal self-attention using action embedding for skeleton-based action recognition

    Get PDF
    Over the past few years, skeleton-based action recognition has attracted great success because the skeleton data is immune to illumination variation, view-point variation, background clutter, scaling, and camera motion. However, effective modeling of the latent information of skeleton data is still a challenging problem. Therefore, in this paper, we propose a novel idea of action embedding with a self-attention Transformer network for skeleton-based action recognition. Our proposed technology mainly comprises of two modules as, i) action embedding and ii) self-attention Transformer. The action embedding encodes the relationship between corresponding body joints (e.g., joints of both hands move together for performing clapping action) and thus captures the spatial features of joints. Meanwhile, temporal features and dependencies of body joints are modeled using Transformer architecture. Our method works in a single-stream (end-to-end) fashion, where MLP is used for classification. We carry out an ablation study and evaluate the performance of our model on a small-scale SYSU-3D dataset and large-scale NTU-RGB+D and NTU-RGB+D 120 datasets where the results establish that our method performs better than other state-of-the-art architectures.publishedVersio

    Impact of Leader-Member Exchange Relationship and Job design on Counterproductive Work Behavior (CWB): The Role of Job Burnout

    Get PDF
    The aim of this study is to synthesize the effect of Leader-member exchange relationship (LMX) and Job design on Counterproductive work behavior (CWB) through the mediating lens of all three dimensions of Job Burnout that based upon Conservation of Resource (COR) theory. A sample size of 350 respondents was used for collecting data with the help of research survey by distributing questionnaires to the employees who are working in public sector universities of higher education sector that are currently based in twin cities (Islamabad and Rawalpindi) of Pakistan. Techniques of Simple and Multiple linear regressions were carried out for accessing mediation analysis via SPSS version 21.0 and AMOS version 27.0.  Findings of this study has revealed that LMX relationship is significantly but negatively related to Counterproductive work behavior (CWB) and Job design is significantly and positively related to (CWB). Also Job Burnout is a significant variable that mediates between Leader-member exchange for developing quality exchange relationships, job design and counterproductive work behaviors. This study gave into new insights and results into the existing body of knowledge along with practical implications and outcomes. Limitations for this study along with future directions of research are also discussed at the end

    A General-Purpose Graphics Processing Unit (GPGPU)-Accelerated Robotic Controller Using a Low Power Mobile Platform

    Get PDF
    Robotic controllers have to execute various complex independent tasks repeatedly. Massive processing power is required by the motion controllers to compute the solution of these computationally intensive algorithms. General-purpose graphics processing unit (GPGPU)-enabled mobile phones can be leveraged for acceleration of these motion controllers. Embedded GPUs can replace several dedicated computing boards by a single powerful and less power-consuming GPU. In this paper, the inverse kinematic algorithm based numeric controllers is proposed and realized using the GPGPU of a handheld mobile device. This work is the extension of a desktop GPU-accelerated robotic controller presented at DAS’16 where the comparative analysis of different sequential and concurrent controllers is discussed. First of all, the inverse kinematic algorithm is sequentially realized using Arduino-Due microcontroller and the field-programmable gate array (FPGA) is used for its parallel implementation. Execution speeds of these controllers are compared with two different GPGPU architectures (Nvidia Quadro K2200 and Nvidia Shield K1 Tablet), programmed with Compute Unified Device Architecture (CUDA) computing language. Experimental data shows that the proposed mobile platform-based scheme outperform s the FPGA by 5 and boasts a 100 speedup over the Arduino-based sequential implementation

    GPGPU Accelerated Deep Object Classification on a Heterogeneous Mobile Platform

    Get PDF
    Deep convolutional neural networks achieve state-of-the-art performance in image classification. The computational and memory requirements of such networks are however huge, and that is an issue on embedded devices due to their constraints. Most of this complexity derives from the convolutional layers and in particular from the matrix multiplications they entail. This paper proposes a complete approach to image classification providing common layers used in neural networks. Namely, the proposed approach relies on a heterogeneous CPU-GPU scheme for performing convolutions in the transform domain. The Compute Unified Device Architecture(CUDA)-based implementation of the proposed approach is evaluated over three different image classification networks on a Tegra K1 CPU-GPU mobile processor. Experiments show that the presented heterogeneous scheme boasts a 50 speedup over the CPU-only reference and outperforms a GPU-based reference by 2, while slashing the power consumption by nearly 30%

    Interval Type 2 Fuzzy Adaptive Motion Drive Algorithm Design

    Get PDF
    Motion drive algorithms are a set of filters designed to simulate realistic motion and are an integral part of contemporary vehicle simulators. This paper presents the design of a novel intelligent interval type 2 fuzzy adaptive motion drive algorithm for an off-road uphill vehicle simulator. The off-road, uphill vehicle simulator is used to train and assess the driver’s behavior under varying operational and environmental conditions in mountainous terrain. The proposed algorithm is the first of its kind to be proposed for off-road uphill vehicle simulators, and it offers numerous benefits over other motion drive algorithms. The proposed algorithm enables the simulator to adapt to changes in the uphill road surface, vehicle weight distribution, and other factors that influence off-road driving in mountainous terrain. The proposed algorithm simulates driving on hilly terrain more realistically than existing algorithms, allowing drivers to learn and practice in a safe and controlled environment. Additionally, the proposed algorithm overcomes limitations present in existing algorithms. The performance of the proposed algorithm is evaluated via test drives and compared to the performance of the conventional motion drive algorithm. The results demonstrate that the proposed algorithm is more effective than the conventional motion drive algorithm for the ground vehicle simulator. The pitch and roll responses demonstrate that the proposed algorithm has enabled the driver to experience abrupt changes in terrain while maintaining the driver’s safety. The surge response demonstrated that the proposed MDA handled the acceleration and deceleration of the vehicle very effectively. In addition, the results demonstrated that the proposed algorithm resulted in a smoother drive, prevented false motion cues, and offered a more immersive and realistic driving experience.publishedVersio

    A Gentle Introduction to Reinforcement Learning and its Application in Different Fields

    Get PDF
    Due to the recent progress in Deep Neural Networks, Reinforcement Learning (RL) has become one of the most important and useful technology. It is a learning method where a software agent interacts with an unknown environment, selects actions, and progressively discovers the environment dynamics. RL has been effectively applied in many important areas of real life. This article intends to provide an in-depth introduction of the Markov Decision Process, RL and its algorithms. Moreover, we present a literature review of the application of RL to a variety of fields, including robotics and autonomous control, communication and networking, natural language processing, games and self-organized system, scheduling management and configuration of resources, and computer vision

    Effects of a high-dose 24-h infusion of tranexamic acid on death and thromboembolic events in patients with acute gastrointestinal bleeding (HALT-IT): an international randomised, double-blind, placebo-controlled trial

    Get PDF
    Background: Tranexamic acid reduces surgical bleeding and reduces death due to bleeding in patients with trauma. Meta-analyses of small trials show that tranexamic acid might decrease deaths from gastrointestinal bleeding. We aimed to assess the effects of tranexamic acid in patients with gastrointestinal bleeding. Methods: We did an international, multicentre, randomised, placebo-controlled trial in 164 hospitals in 15 countries. Patients were enrolled if the responsible clinician was uncertain whether to use tranexamic acid, were aged above the minimum age considered an adult in their country (either aged 16 years and older or aged 18 years and older), and had significant (defined as at risk of bleeding to death) upper or lower gastrointestinal bleeding. Patients were randomly assigned by selection of a numbered treatment pack from a box containing eight packs that were identical apart from the pack number. Patients received either a loading dose of 1 g tranexamic acid, which was added to 100 mL infusion bag of 0·9% sodium chloride and infused by slow intravenous injection over 10 min, followed by a maintenance dose of 3 g tranexamic acid added to 1 L of any isotonic intravenous solution and infused at 125 mg/h for 24 h, or placebo (sodium chloride 0·9%). Patients, caregivers, and those assessing outcomes were masked to allocation. The primary outcome was death due to bleeding within 5 days of randomisation; analysis excluded patients who received neither dose of the allocated treatment and those for whom outcome data on death were unavailable. This trial was registered with Current Controlled Trials, ISRCTN11225767, and ClinicalTrials.gov, NCT01658124. Findings: Between July 4, 2013, and June 21, 2019, we randomly allocated 12 009 patients to receive tranexamic acid (5994, 49·9%) or matching placebo (6015, 50·1%), of whom 11 952 (99·5%) received the first dose of the allocated treatment. Death due to bleeding within 5 days of randomisation occurred in 222 (4%) of 5956 patients in the tranexamic acid group and in 226 (4%) of 5981 patients in the placebo group (risk ratio [RR] 0·99, 95% CI 0·82–1·18). Arterial thromboembolic events (myocardial infarction or stroke) were similar in the tranexamic acid group and placebo group (42 [0·7%] of 5952 vs 46 [0·8%] of 5977; 0·92; 0·60 to 1·39). Venous thromboembolic events (deep vein thrombosis or pulmonary embolism) were higher in tranexamic acid group than in the placebo group (48 [0·8%] of 5952 vs 26 [0·4%] of 5977; RR 1·85; 95% CI 1·15 to 2·98). Interpretation: We found that tranexamic acid did not reduce death from gastrointestinal bleeding. On the basis of our results, tranexamic acid should not be used for the treatment of gastrointestinal bleeding outside the context of a randomised trial

    Effect of Core Self-Evaluation on Employee's Outcomes: The Role of Mentorship

    No full text
    The study aims to examine the relationship between Core self-evaluation on employee’s outcomes (Work engagement, Work Success and Intention to leave) with moderation role of mentorship. The study investigates the relationship in Pakistani environment, particularly the employees in public sector universities of twin cities (Rawalpindi & Islamabad) of Pakistan. Data was collected through the questionnaire distributed among the 440 employees of different universities in capital city (Rawalpindi and Islamabad) of Pakistan. The responses obtained, their assessment done, passed through the statistical programs by using SPSS (23 version) to obtain the findings. Descriptive statistical methods (e.g. frequency, average, standard deviation) have been utilized during the appraisal of the data. The consequences explored that CSA is intensely linked to employee’s outcomes, the current research also empirically investigated that mentorship has moderating effect in the relationship between CSA and employee’s outcomes. This research is amongst those rare studies conducted in Pakistani environment that have inspected effect of CSA on employee’s outcome and also inspected the moderating effect of mentorship between the relationship of core self-evaluation and employee’s outcomes
    corecore